PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KHN21019.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family B3
Protein Properties Length: 445aa    MW: 49788 Da    PI: 8.2254
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KHN21019.1genomeTCUHKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B358.99.2e-1933113997
                 HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
          B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97 
                 d+   + l lpk+f ++ +  k+ +  +tl+ + g  W++ +  + ++++ +++ GW++Fvk++ Lke+Df+vFk++g+s+f   v +f
  KHN21019.1  33 DYD--QHLALPKTFSDNLK--KKLPENVTLKGPGGVMWNIGM--TTRDDTLYFGHGWEQFVKDHCLKENDFLVFKYNGESQF--DVLIF 113
                 443..3499*****85555..558889***************..9********************************99999..77666 PP

2B334.83e-11309389487
                 E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS CS
          B3   4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr 87 
                 v++p +v k+  +++  + + +h ++  +s++++l+  +  +W  +++y++ ++   lt+GWk+F  + +L+egD +vFk  g+
  KHN21019.1 309 VMKPTHVYKRFFVSIRGTWIGKHISP--SSQDVILRMGK-GEWIARYSYNNIRNNGGLTGGWKHFSLDSNLEEGDACVFKPAGQ 389
                 67888999999999999999888655..67789988855.58*******99999999***********************7665 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019362.94E-2221114IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086315.28423116IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.6E-2225114IPR015300DNA-binding pseudobarrel domain
CDDcd100176.14E-2025114No hitNo description
SMARTSM010198.9E-1726116IPR003340B3 DNA binding domain
PfamPF023625.2E-1535112IPR003340B3 DNA binding domain
SuperFamilySSF1019362.75E-11306391IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.102.5E-14306390IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.019306404IPR003340B3 DNA binding domain
PfamPF023624.4E-9309389IPR003340B3 DNA binding domain
CDDcd100171.51E-11309402No hitNo description
PROSITE profilePS508638.839347404IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005773Cellular Componentvacuole
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 445 aa     Download sequence    Send to blast
MVGQNCDGCR SWEEDIYWSH FQFLHFVQFL HADYDQHLAL PKTFSDNLKK KLPENVTLKG  60
PGGVMWNIGM TTRDDTLYFG HGWEQFVKDH CLKENDFLVF KYNGESQFDV LIFNGWSLCE  120
KAGSYFVRKC GHTEIDHAGG SLNKKRDTDN DSLEEGNIPS NAGVECALHE KSAHVNGTKE  180
PIDVPPETPP TENTFNAGVE SSGVEQFTPD GGVTLAAVPS ETANGKRIRN IVSAVKHVHT  240
KRKGRPAKWH VRERTLDWVA ALEAEPVSAS RSGTYEVYKS NRRPVTDDET RKIESLAKAA  300
CTDDSIYVVM KPTHVYKRFF VSIRGTWIGK HISPSSQDVI LRMGKGEWIA RYSYNNIRNN  360
GGLTGGWKHF SLDSNLEEGD ACVFKPAGQI NNTFVIDMSI FRVVPETVPL TPMSRGTRTG  420
TRTGTGRRGR KPATMKSIQT QLSSP
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00468DAPTransfer from AT4G33280Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0982360.0BT098236.1 Soybean clone JCVI-FLGm-23C7 unknown mRNA.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006595618.10.0PREDICTED: uncharacterized protein LOC100813851 isoform X1
TrEMBLA0A0B2QLE60.0A0A0B2QLE6_GLYSO; B3 domain-containing protein REM16
TrEMBLK7M5I90.0K7M5I9_SOYBN; Uncharacterized protein
STRINGGLYMA14G08630.20.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF44462658
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.18e-42B3 family protein
Publications ? help Back to Top
  1. Qi X, et al.
    Identification of a novel salt tolerance gene in wild soybean by whole-genome sequencing.
    Nat Commun, 2014. 5: p. 4340
    [PMID:25004933]